Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Adapting a Robust Multi-genre NE System for Automatic Content Extraction

Identifieur interne : 001826 ( Main/Exploration ); précédent : 001825; suivant : 001827

Adapting a Robust Multi-genre NE System for Automatic Content Extraction

Auteurs : Diana Maynard [Royaume-Uni] ; Hamish Cunningham [Royaume-Uni] ; Kalina Bontcheva [Royaume-Uni] ; Marin Dimitrov [Bulgarie]

Source :

RBID : ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF

Abstract

Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.

Url:
DOI: 10.1007/3-540-46148-5_27


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<author>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
</author>
<author>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
</author>
<author>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
</author>
<author>
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF</idno>
<date when="2002" year="2002">2002</date>
<idno type="doi">10.1007/3-540-46148-5_27</idno>
<idno type="url">https://api.istex.fr/document/50643758F6A5345504D4B37A8BBA39C828D900BF/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000860</idno>
<idno type="wicri:Area/Istex/Curation">000850</idno>
<idno type="wicri:Area/Istex/Checkpoint">000F40</idno>
<idno type="wicri:doubleKey">0302-9743:2002:Maynard D:adapting:a:robust</idno>
<idno type="wicri:Area/Main/Merge">001906</idno>
<idno type="wicri:Area/Main/Curation">001826</idno>
<idno type="wicri:Area/Main/Exploration">001826</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Adapting a Robust Multi-genre NE System for Automatic Content Extraction</title>
<author>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield</wicri:regionArea>
<wicri:noRegion>Sheffield</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield</wicri:regionArea>
<wicri:noRegion>Sheffield</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>Dept of Computer Science, University of Sheffield, 211 Portobello St, S1 4DP, Sheffield</wicri:regionArea>
<wicri:noRegion>Sheffield</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Royaume-Uni</country>
</affiliation>
</author>
<author>
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Bulgarie</country>
<wicri:regionArea>Sirma AI Ltd, Ontotext Lab, 38AHristo Botev Blvd, 1000, Sofia</wicri:regionArea>
<placeName>
<settlement type="city">Sofia</settlement>
<region nuts="2">Sofia-ville (oblast)</region>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Bulgarie</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2002</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">50643758F6A5345504D4B37A8BBA39C828D900BF</idno>
<idno type="DOI">10.1007/3-540-46148-5_27</idno>
<idno type="ChapterID">27</idno>
<idno type="ChapterID">Chap27</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Many current information extraction systems tend to be designed with particular applications and domains in mind. With the increasing need for robust language engineering tools which can handle a variety of language processing demands, we have used the GATE architecture to design MUSE - a system for named entity recognition and related tasks. In this paper, we address the issue of how this general-purpose system can be adapted for particular applications with minimal time and effort, and how the set of resources used can be adapted dynamically and automatically. We focus specifically on the challenges of the ACE (Automatic Content Extraction) entity detection and tracking task, and preliminary results show promising figures.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Bulgarie</li>
<li>Royaume-Uni</li>
</country>
<region>
<li>Sofia-ville (oblast)</li>
</region>
<settlement>
<li>Sofia</li>
</settlement>
</list>
<tree>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
</noRegion>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
<name sortKey="Bontcheva, Kalina" sort="Bontcheva, Kalina" uniqKey="Bontcheva K" first="Kalina" last="Bontcheva">Kalina Bontcheva</name>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
<name sortKey="Cunningham, Hamish" sort="Cunningham, Hamish" uniqKey="Cunningham H" first="Hamish" last="Cunningham">Hamish Cunningham</name>
<name sortKey="Maynard, Diana" sort="Maynard, Diana" uniqKey="Maynard D" first="Diana" last="Maynard">Diana Maynard</name>
</country>
<country name="Bulgarie">
<region name="Sofia-ville (oblast)">
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
</region>
<name sortKey="Dimitrov, Marin" sort="Dimitrov, Marin" uniqKey="Dimitrov M" first="Marin" last="Dimitrov">Marin Dimitrov</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001826 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001826 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:50643758F6A5345504D4B37A8BBA39C828D900BF
   |texte=   Adapting a Robust Multi-genre NE System for Automatic Content Extraction
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024